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Abstract — The performance of iterative decoding techniques 
for linear block codes correcting erasures depends very much 
on the sizes of the stopping sets associated with the under- 
lying Tanner graph, or, equivalently, the parity-check matrix 
representing the code. In this paper, we introduce the notion 
of dead-end sets to explicitly demonstrate this dependency. The 
choice of the parity-check matrix entails a trade-off between 
performance and complexity. We give bounds on the complexity 
of iterative decoders achieving optimal performance in terms 
of the sizes of the underlying parity-check matrices. Further, 
we fully characterize codes for which the optimal stopping set 
enumerator equals the weight enumerator. 

Index Terms — Dead-end set, iterative decoding, linear code, 
parity-check matrix, stopping set. 



I. Introduction 

ITERATIVE decoding techniques, especially when applied 
to low-density parity-check (LDPC) codes, have attracted 
a great attention recently. In these techniques, decoding is 
based on a Tanner graph determined by a parity-check matrix 
of the code, which does not necessarily, and typically does 
not, have full rank. It is well known that the performance 
of iterative decoding algorithms in case of binary erasure 
channels depends on the sizes of the stopping sets associated 
with the Tanner graph representing the code [3], Several 
interesting results on stopping sets associated with Tanner 
graphs of given girths are given in [8], [11]. There are more 
specific results for classes of codes represented by particular 
Tanner graphs, see, e.g., [7], [1], [15], as well as more general 
results pertaining to ensembles of LDPC codes, see e.g., [2], 
[3], [6], [12]. 

In this paper, we define the notion of dead-end sets to 
explicitly show the dependency of the performance on the 
stopping sets. We then present several results that show how 
the choice of the parity-check matrix of the code, which deter- 
mines decoding complexity, affects the stopping and the dead- 
end sets, which determine decoding performance. Our study 
differs from the aforementioned studies, but agrees with the 
studies by Schwartz and Vardy [13], Hollmann and Tolhuizen 
[5], Han and Siegel [4], and Weber and Abdel-Ghaffar [14], in 
its focus on the relationship between the stopping sets on one 
hand and the underlying code representation, rather than the 
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code itself, on the other hand. Since linear algebra is used to 
study this relationship, for our purpose, parity-check matrices 
are more convenient than the equivalent Tanner graphs for 
code representation. 

Let C be a binary linear [n, k, d] block code, where n, k, and 
d denote the code's length, dimension, and Hamming distance, 
respectively. Such a code is a /c-dimensional subspace of the 
space of binary vectors of length n, in which any two different 
elements differ in at least d positions. The set of codewords 
of C can be defined as the null space of the row space of an 
r x n binary parity-check matrix H = (hij) of rank n — k. 
Assuming all rows in H are different, 



- k < r < T~ 



(1) 



The row space of H is the [n, n — k, d ] dual code C of C. 

The support of a binary word x = {x\, X2, ■ ■ ■ , x n ) is the set 
{j : Xj 0} and the weight of x is the size of its support. For 
the zero word = (0, 0, ... , 0), the support is the empty set, 
0, and the weight is zero. Since a binary word x is a codeword 
of C if and only if xH T = 0, the parity-check matrix H gives 
rise to r parity-check equations, denoted by 



PCE,(x) : 



=1 hijXj = for z = 1,2,. 



(2) 



An equation PCEi(x) is said to check x in position j if and 
only if hij = 1. 

On the binary erasure channel, each bit of the transmitted 
codeword is erased with probability e, while it is received 
correctly with probability 1 — e, where < e < 1. For a 
received word r = (r±, T2, ■ ■ ■ , r n ), the erasure set is 



£ r = {] ■ r, ± 0, 1}. 



(3) 



A received word can be decoded unambiguously if and only 
if it matches exactly one codeword of C on all its non-erased 
positions. Since C is a linear code, this is equivalent to the 
condition that the erasure set £ r does not contain the support of 
a non-zero codeword. If £ r does contain the support of a non- 
zero codeword, then it is said to be incorrigible. A decoder for 
C which achieves unambiguous decoding whenever the erased 
set is not incorrigible is said to be optimal for the binary 
erasure channel. An exhaustive decoder searching the complete 
set of codewords is optimal. However, such a decoder usually 
has a prohibitively high complexity. 

Iterative decoding procedures may form a good alternative, 
achieving close to optimal performance at much lower com- 
plexity [9], in particular for LDPC codes. Here, we consider 
a well-known algorithm, often expressed in terms of a Tanner 
graph, which exploits the parity-check equations in order to 
determine the transmitted codeword. Initially, we set c = r and 
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£ c = £ r . If PCEi(c) checks c in exactly one erased position 
j*, then we use (|2ji to set 



5> 



(4) 



and we remove j* from the erasure set £ c . Applying this 
procedure iteratively, the algorithm terminates if there is no 
parity-check equation left which checks exactly one erased 
symbol. Erasure sets for which this is the case have been 
named stopping sets [3]. In case the final erasure set £ c is 
empty, the iterative algorithm retrieves all erased symbols, and 
thus the final word c is the transmitted codeword. In case the 
final erasure set £ c is a non-empty stopping set, the iterative 
decoding process is unsuccessful. The final erasure set £ c is 
the union of the stopping sets contained in £ r , and thus £ c is 
empty if and only if £ r contains no non-empty stopping set. 
Therefore, we introduce the notion of a dead-end set for an 
erasure set which contains at least one non-empty stopping 
set. In summary, on the binary erasure channel, an optimal 
decoder is unsuccessful if and only if £ r is an incorrigible set, 
and an iterative decoder is unsuccessful if and only if £ r is a 
dead-end set. 

This paper is organized as follows. In Section ILTI we charac- 
terize codeword supports, incorrigible sets, stopping sets, and 
dead-end sets in terms of a parity-check matrix and derive 
basic results from this characterization. We also review results 
from [13] and [5] which are most relevant to this work. Dead- 
end sets and stopping sets are studied in Sections fTTTI and HV1 
respectively. Conclusions are presented in Section IVl 

II. Definitions and Preliminaries 

Again, let C be a linear binary [n, k, d] block code with an 
r x n binary parity-check matrix H = (hij) of rank n — k. 
Let S be a subset of {1,2,..., n}. For any H and S, let Hs 
denote the r x |<S| submatrix of H consisting of the columns 
indexed by S. A set S is the support of a codeword if and 
only if all rows in H5 have even weight, i.e., if and only if 



\{j € S : h id = 1}\ = 0(2) Vi = l,2, 



(5) 



A set S is an incorrigible set if and only if it contains the 
support of a non-zero codeword. A set S is a stopping set for 
the parity-check matrix H if and only if H5 does not contain 
a row of weight one, i.e., if and only if 



\{j eS:h i , j = l}\^l Vi = l,2,. 



(6) 



Hence, the support of any codeword is a stopping set. A set 
S is a dead-end set for the parity-check matrix H if and only 
if it contains a non-empty stopping set. 

The polynomial A(x) = X)™=o AiX 1 , where A% is the num- 
ber of codewords of weight i, is called the weight enumerator 
of code C. Similarly, I(x) = Y^i=o ^i x% ^ where U is the 
number of incorrigible sets of size i, is called the incorrigible 
set enumerator of C. Clearly, 



d = min{« > 1 : A4 > 0} = min{i > : U > 0} (7) 



and 



A, = 



1 if i = 0, 
if 1< i < d 



(8) 



The incorrigible set enumerator satisfies 

if < i < d - 1, 
A, i£i = d, 

(") ifn-fc+l<i<n, 



(9) 



where the last property follows from the observation that any 
set S of size |<S| > n — k contains the support of a non-zero 
codeword as the rank of H5 is at most n — k. 

The polynomials S(x) — X)"=o where Si is the 

number of stopping sets of size i, and D(x) = Y^7=o ®iX % , 
where Di is the number of dead-end sets of size i, are called 
the stopping set enumerator and the dead-end set enumerator, 
respectively, of parity-check matrix H. From the observation 
that (13 and are equivalent for sets S with |<S| < 2, it 
follows that 



Si = At and A = h if < i < 2. 



(10) 



In particular, So = 1, Si = S2 = 0, and D — D x = D 2 = 
for any parity-check matrix of a code of minimum distance 
d > 3. 

Let s denote the smallest size of a non-empty stopping set 
(and thus the smallest size of a dead-end set), i.e., 

s = min{i >l:S l >0} = min{i > : A > 0}. (11) 

The number s is called the stopping distance for the parity- 
check matrix H in [13]. For any parity-check matrix H of a 
binary linear [n, k, d] block code C, it holds that the stopping 
set enumerator satisfies 



1 if i = 0, 
if 1 < i < s - 1, 
( n ) if n - d 1 - + 2 < i < n. 



(12) 



where the first property follows from i ll Ob and (|8j, the second 
property follows from the definition of s, and the third property 
follows from the fact that the weight of any row in H5 is 
either or at least equal to d 1 - — (d — 2) = 2 for any S with 
|<S| > n — d 1 - + 2. Further, again for any parity-check matrix 
H, it follows from the definitions of the various enumerators, 
(|9j, and d!2l >. that the dead-end set enumerator satisfies 

( if < i < s - 1, 
Di = \ Si if i = s, (13) 
[ (") if n - k + 1 < i < n. 

For code C on the binary erasure channel, the probability 
of unsuccessful decoding (UD) for an optimal (OPT) decoder 
is 



^u D PT (c) = £w- e ) r 



I d e d = A d e d . 



(14) 



i=d 



Similarly, the probability of unsuccessful decoding for an 
iterative (IT) decoder based on parity-check matrix H is 



^u T d(H) 



i—s 



D s e s = S s e s 



(15) 



Hence, these two probabilities are completely determined by 
the incorrigible and dead-end set enumerators. Notice from 
J14b and H5\ that iterative decoding is optimal if and only 
if D(x) = I(x). At small erasure probabilities, P°^ T (C) 
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and Pup(H) are dominated by the terms A d t d and S s e s , 
respectively. Actually, for sufficiently small values of e, the 
parameters d and s are the most important parameters char- 
acterizing the performance of optimal decoding and iterative 
decoding, respectively. In (II Oi it is stated that if i < 2, then 
Si = Ai, Therefore, s = d for any parity-check matrix H of 
a code with d < 3, which is derived as Theorem 3 in [13]. 
Here, we show that this cannot be extended further. 

Theorem 1: For any code C with Hamming distance d > 4, 
there exists a parity-check matrix H for which s = 3. 

Proof: We may order the positions so that C has a codeword 
composed of d ones followed by n — d zeros. In particular, 
the first d columns in any given parity-check matrix of C are 
linearly dependent, but no d — 1 columns are such. The row 
space of the submatrix composed of these first d columns has 
dimension d — 1 and a sequence of length d belongs to this 
row space if and only if its weight is even. By elementary row 
operations, we can obtain a parity-check matrix of the form 



H 



H' 
H" 



(16) 



for some matrices H' and H" of appropriate sizes, where 
is the (d — 1) x d matrix given by 



H' 



/ 1 1 
110 
11110 
1 1 



\ 



\ 









(17) 



1 1 / 



Clearly, S = {1,2,3} is a stopping set for H as no row of 
Us has weight one. ■ 

Contrary to the weight enumerator and the incorrigible set 
enumerator, which are fixed for a code C, the stopping and 
dead-end set enumerators depend on the choice of the parity- 
check matrix H. Theorem^shows that no matter how large the 
Hamming distance of the code is, a bad choice of the parity- 
check matrix may lead to very poor performance. Therefore, 
it is important to properly select the parity-check matrix of a 
code when applying iterative decoding. 

Clearly, adding rows to a parity-check matrix does not 
increase any coefficient of the stopping set enumerator or the 
dead-end set enumerator. On the contrary, these coefficients 
may actually decrease at the expense of higher decoding 
complexity. The rows to be added should be in the dual 
code C of C. By having all 2™~ fe codewords in C as 
rows, we obtain a parity-check matrix that gives the best 
possible performance, but also the highest complexity, when 
applying iterative decoding. Since the order of the rows does 
not affect the decoding result, we refer to such matrix, with 
some ordering imposed on its rows which is irrelevant to 
our work, as the complete parity-check matrix of the code C, 
and denote it by H*. Its stopping set enumerator is denoted 
by S*(x) = Y^i=o S* x% > its dead-end set enumerator by 
D*(x) = J2i=oD*x l , an d its stopping distance by s*. Since 
the support of any codeword is a stopping set for any parity- 



check matrix, we have 

Si > S* > A, and D 4 > D* > h Mi = 0, 1, . 



(18) 



Consequently, s < s* < d, and S*(x) and D*(x) are called 
the code's optimal stopping set enumerator and optimal dead- 
end set enumerator, respectively. Schwartz and Vardy [13] have 
shown that 

s*=d (19) 

and the results derived recently by Hollmann and Tolhuizen 
[5] imply, in addition, that 



and 



S* d = A d 



D*(x) = I{x). 



(20) 



(21) 



Actually, Schwartz and Vardy [13] have shown that, for 
d > 3, it is possible to construct a parity-check matrix 
with at most Y^=i ("7^) rows f° r which s = d. They 
also obtain interesting results on the minimum number of 
rows in a parity-check matrix for which s = d. They obtain 
general bounds on this minimum number, which they call the 
stopping redundancy, as well as bounds for specific codes 
such as the Golay code and Reed-Muller codes. Han and 
Siegel [4] derived another general upper bound on the stopping 
redundancy for d > 2 given by £l=i~ 1)/21 

Hollmann and Tolhuizen [5] specified rows that can be 
formed from any (n — k) x n parity-check matrix of rank 
n — k to yield a parity-check matrix for which Di = Ii 
for < i < m, where m is any given integer such that 
2 < m < n — k. They have shown that the number of rows 
in the smallest parity-check matrix achieving this is at most 
Y-ifn— 1 f n—k—l) 
Z^i=o \ i J " 

Example 1: Let C be the [8, 4, 4] Reed-Muller code. One of 
the parity-check matrices of C is 



/ 1 






1 
1 



V i 



i 
i 




1 

o / 



(22) 



For i = 4, 5, 6, 7, deleting the last 8 — i rows in Hg still gives 
a parity-check matrix for the code C. Table I gives the 
stopping set enumerator S(x) and the dead-end set enumerator 
D(x) for the parity-check matrix H; for i = 4, 5, and 8. 
The table also gives S*(x) and D*(x) corresponding to the 
complete parity-check matrix H*, and the weight enumerator 
A(x) and the incorrigible set enumerator I(x). We point out 
that the matrix H4 is a frequently used full rank matrix for 
this code. For this matrix s = 3. The matrix H5 is the matrix 
proposed by Schwartz and Vardy [13] to achieve s = 4. For 
this matrix S4 > A4. The matrix Hg is constructed based 
on the techniques proposed by Hollmann and Tolhuizen to 
achieve S4 = A4. For later purposes, we also define the matrix 
H14 whose rows are the fourteen non-zero codewords in C 1 - = 
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C of weight four. The stopping set enumerator and the dead- 
end set enumerator for this parity-check matrix are also listed 
in the table. 

III. Dead-End Set Results 

In this section, we investigate parity-check matrices for 
which the iterative decoding procedure achieves optimal per- 
formance, i.e., for which 

Phl(H)=P°E T (C). (23) 

In order to satisfy (1231 . it is necessary and sufficient that the 
dead-end set enumerator equals the incorrigible set enumer- 
ator, i.e., D(x) — I{x). From (12 1> . we know that this is 
the case for the complete parity-check matrix, which contains 
2"~ fc rows. However, from a decoding complexity point of 
view, it may be desirable or required to reduce the number of 
rows in the parity-check matrix. Hence, an interesting research 
challenge is to find a parity check matrix H for code C, with 
a minimum number of rows, but still having D(x) = I(x). 

As stated before, it is shown in [5] that there exists a parity- 
check matrix H with at most "^"^Lq 1 (j 1 ^ 1 ^ rows for which 
Di = Ii, < i < m, for any 1 < m < n — k. By taking 
m = n — k and noticing that Di = Ii for n — k + 1 < i < n 
from (|9jl and d 1 3 1 . we deduce the following result. 

Theorem 2 (Hollmann and Tolhuizen): Let C be an [n, k, d] 
binary linear code with k < n. Then, there exists a parity- 
check matrix with at most 2™~ fc ~ 1 rows for which D{x) = 
I(x). 

Hollmann and Tolhuizen also show that for some codes, and 
in particular for Hamming codes, D(x) ^ I(x) for any parity- 
check matrix with less than 2" _fc_1 rows. However, depending 
on the code, it may be possible to reduce the number of rows in 
a parity-check matrix for which D{x) = I(x) below 2"~ fe ~ 1 
as we show next. 

Theorem 3: Let H be the matrix whose rows are the non- 
zero codewords in C x of weight at most k + 1. Then, H is a 
parity-check matrix for C and for this matrix D(x) = I(x). 

Proof: Let H' be an (n — k) x n parity-check matrix for 
the code C. Then, there is a subset S of {1, 2, . . . , n} of size 
n — k such that is an (n — k) x (n — k) matrix of rank 
n — k. The row space of this matrix contains every unit weight 
vector of length n — k. Therefore, the row space of H' contains 
n — k vectors such that each vector has exactly a single one 
in a unique position indexed by an element in S. Since these 
vectors have weight at most fc+1 and are linearly independent, 
it follows that H, which contains all of them as rows, has rank 
n — k and is indeed a parity-check matrix for C. 

Next, we prove that for this matrix D(x) — I{x), i.e., Di = 
Ii for i — 0,1, ... ,n. From dlOi . (|9j, and dl3> . it suffices to 
show that Di = Ii for 3 < i < n—k. For such an i, assume that 
S' is a subset of {1, 2, . . . , n} of size i which does not contain 
the support of a non-zero codeword. Then, the columns of the 
(n — k) x n parity-check matrix H' indexed by the elements 
in S' are linearly independent. As H' has rank n — fc, there 
is a set S" such that S' C S" C {1, 2, . . . , n} and H^„ is an 
(n — k) x (n — fc) matrix of rank n — k. From the argument 
given in the first part of this proof, H contains n — k vectors 



such that each vector has exactly a single one in a unique 
position indexed by an element in S", and in particular each 
vector has weight at most fc + 1. The existence of any one of 
the i vectors with a single one in a position indexed by an 
element in S' proves that S' is not a stopping set for H. We 
conclude that every stopping set of size i for H contains the 
support of a non-zero codeword. Hence, Di = Ii for all i. ■ 

Let H(x) denote the well-known binary entropy function 
— a:log 2 x — (1 — x) log 2 (l — x) for < x < 1. 

Theorem 4: Let C be an [n, k, d] binary linear code with 
k < n/2 — 1. Then, there exists a parity-check matrix with at 
most 2™ H (( fe+1 )/™) rows for which D(x) = I(x). 

Proof: From the bounds on the sum of binomial coefficients 
as presented on page 310 of [10], it follows that the number 
of codewords in the dual code of weight less than or equal 
to k + 1 is at most equal to 2 nH( - ( - k+1 ' l/n K Hence, the result 
follows from Theorem [3] ■ 

Note that the bound from Theorem |4] improves upon the 
bound from Theorem |2] for low -rate codes. 

IV. Stopping Set Results 

As stated earlier, iterative decoding based on a parity-check 
matrix is optimal, in the sense of having the smallest possible 
unsuccessful decoding probability on the binary erasure chan- 
nel, if and only if D{x) for this matrix is identical to I(x) 
for code C. This holds for the complete parity-check matrix as 
well as other matrices, whose sizes are bounded in SectionlTm 
For D(x) to be identical to I(x), we should have s = d and 
Sd = Ad- Table I shows that it is possible to achieve optimal 
decoding using parity-check matrices, such as Hs, with much 
smaller number of rows than in the complete parity-check 
matrix H*. This is true in spite of the fact that these smaller 
matrices have stopping set enumerators that are different from 
S*(x). We may wonder then what is the effect, if any, of the 
stopping set coefficients Si for i > d on performance. Notice 
that in this paper we defined the probability of unsuccessful 
decoding as the probability that the decoder fails to retrieve 
the transmitted codeword. Although an iterative decoder is 
unsuccessful in case the erasure set is a dead-end set, it still 
succeeds in retrieving those erased bits whose indices do not 
belong to any of the stopping sets contained in the erasure 
set. Therefore, it may be desirable to choose parity-check 
matrices for which Si = Ai not only for i = d but also 
for i > d. Since S* (x) ^ A{x) in Example ^ it follows 
that this is not possible in general. In fact, Theorem [6] will 
show that S*(x) = A(x) only for a rather degenerate class 
of codes. Hence, the best that we may hope for is to have 
parity-check matrices, smaller than the complete parity-check 
matrix to reduce complexity, for which S(x) — S*(x). The 
matrix H14, specified in Theorem [3] is one such matrix for 
the [8, 4, 4] Reed-Muller code. Actually, it can be checked that 
this is the smallest parity-check matrix for this code satisfying 
S(x) = S*(x). 

Example 2: Let C be the [8, 4, 4] Reed-Muller code con- 
sidered in Example ^ From Table I, we notice that the 
iterative decoders based on H*, H14, and Hg achieve the 
smallest possible probability of unsuccessful decoding, while 
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TABLE I 

A(x) AND I(x) FOR THE [8, 4, 4] REED-MULLER CODE AND S(x) AND D(x) FOR THE PARITY-CHECK MATRICES H4, H 5 , Hg, H14, AND H* 





A{x) 


I(x) 




1 + 14a; 4 + x s 


lAx 4 + 56a; 5 + 28a: 6 + 8x 7 + x s 


parity-check matrix 


S(x) 


D(x) 


H 4 


1 + 2x 3 + 24a: 4 + AOx 5 + 28x 6 + 8x 7 + x s 


2x 3 + 32x 4 + 56a; 5 + 28a; 6 + 8a; 7 + a; 8 


H 5 


1 + 18x 4 + 36x 5 + 28a; 6 + 8x 7 + x s 


18a' 4 + 56a; 5 + 28a; 6 + 8a: 7 + x 8 


Hg 


1 + 14x 4 + 24a; 5 + 28a; 6 + 8x 7 + x s 


14a; 4 + 56a' 5 + 28a; 6 + 8a; 7 + x 8 


H14 


1 + 14a; 4 + 28a; 6 + 8x 7 + x s 


14a: 4 + 56a' 5 + 28a; 6 + 8a; 7 + a; 8 


H* 


1 + 14a; 4 + 28a: 6 + 8x 7 + x 8 


14a: 4 + 56a; 5 + 28a: 6 + 8a: 7 + x 8 



the iterative decoders based on H 4 and H5 do not. Although 
H14 is larger than Hg and both achieve the maximum suc- 
cessful decoding probability, there are advantages in using 
H14 instead of H§. For instance, suppose that the erasure 
set is {1,2,3,7,8}. This erasure set is an incorrigible set 
since it contains {1,2,7,8} which is the support of a non- 
zero codeword in the code. Therefore, any decoding method 
fails in retrieving the transmitted codeword. However, iterative 
decoding based on H14 succeeds in determining the erased 
bit C3 from the parity-check equation c 3 + c 4 + C5 + c 6 = 
since (00111100) is a row in H 14 . On the other hand, iterative 
decoding based on Hg does not succeed in retrieving any of 
the erased bits. Actually, since the coefficient of x 5 in the 
stopping set enumerator of H 44 is zero, it follows that if the 
erasure set is a dead-end set of size five, it is always possible 
to retrieve one of the erased bits using H14. This is not true 
if matrix Hg is used instead. 

We will show that the range < i < 2 specified by (II Oi 
for which Si = Ai can be considerably extended in case H is 
the complete parity-check matrix H*. First we start with two 
lemmas. 

Lemma 1: For any non-empty set S C {1,2, ... ,n} which 
does not contain the support of a non-zero codeword, each 
binary vector of length |5| appears exactly 2™~ fe -|'S times as 
a row in HJ. 

Proof: Since S does not contain the support of a non-zero 
codeword, the 2 n ~ k x |<S| matrix Hg has rank |«S|, Hence, 
there are |«S| linearly independent rows in H£. The linear 
combinations of these |<S| rows generate the space of all binary 
vectors of length |<S|, and thus each of these 21 s ' vectors 
appears exactly 2 n ~ k /2> s > times as a row in Hg. ■ 

Lemma 2: For any set S C {1,2, ... ,n} which contains 
exactly one support S' of a non-zero codeword, each binary 
vector of length |<S|, with even weight on the positions indexed 
by S' and any weight on the positions indexed by S \ S', 
appears exactly 2™~ fc -|'S|+i times as a row in H 5 . 

Proof: Since S contains exactly one support S' of a non- 
zero codeword, the 2"~ fe x |<S| matrix H 5 has rank |<S| — 1. 
Hence, there are |<S| — 1 linearly independent rows in H 5 . 
The linear combinations of these |<S| — 1 rows generate the 
space of all binary vectors of length |«S| with even weight on 
the positions indexed by S' and any weight on the positions 
indexed by S\S', and thus each of these 2l s l~ 1 vectors appears 



exactly 2™ fc /2' <s l 1 times as a row in H*< 



Theorem 5: For any code, 
S* = Ai for i = Q, 1, . . . 



min{ r 3d/2l - l,n}, (24) 



i.e., the enumerators S*(x) and A(x) are equal in at least the 
first mm{\3d/2~\,n + 1} coefficients. 

Proof: Since the result is trivial for i = 0, we may assume 
1 < i < min{ [3d/2] — 1, n). Suppose that S is a stopping set 
of size i for H*, which is not the support of a codeword. This 
set S contains at most one support of a non-zero codeword, 
since it follows from the Griesmer bound [10] that any linear 
code of dimension greater than 1 and Hamming distance at 
least d has a length of at least d+ [d/2] = |~3d/2~|. It follows 
from Lemmas ^ an d ID that H^ contains at least one row of 
weight one. Together with (|5Jl, we reach a contradiction to 
the assumption that S is a stopping set for H*. Hence, any 
stopping set of size i for H* is the support of a codeword. 
In conclusion, S* < Ai, and together with Jl 8i we obtain the 
result presented in J24I . ■ 

In the remainder of this section, we give a complete 
characterization of codes that have parity-check matrices for 
which S(x) = A(x), i.e., codes with parity-check matrices 
for which every stopping set is a support of a codeword. For 
convenience, such codes are called minimum stopping. From 
( II (St . we conclude that a code is minimum stopping if and 
only if its optimal stopping set enumerator equals its weight 
enumerator, i.e., S*(x) — A(x). We start by giving three 
classes of codes satisfying this condition. 

(i) lZ n is the [n, l,n] repetition code consisting of the all- 
zero and all-one vectors of length n. From Theorem [5] it 
follows that S* = Ai for i = 0, 1, . . . , n. Hence, Sq = Aq = 
1, S* =A n = l, and S* = Ai = for i = 1, 2, . . . , n - 1. 

(ii) T n is the [n, n, 1} full-code consisting of all binary 
vectors of length n. Clearly, S* — Ai — (™) for i = 

0,1, ...,71. 

(iii) Z n is the [n, 0, 00] zero-code consisting of one code- 
word only, which is the all-zero vector of length n. Since all 
vectors of length n, including those of weight one, belong to 
the complete parity-check matrix of the code, it follows that 
St = Ai = for % = 1, 2, . . . , n, and S$ = A = 1. 

Next, we introduce a useful notation. For i = 1, 2, . . . ,t, let 
d be a binary linear [rii, ki, d%[ block code. Then, we define 



Ci ©C 2 



{(ci, c 2 , . . .,Cf) : Ci G Ci 
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Vi=l,2,...,i}, 



(25) 



i.e., C\ © C 2 © • ■ • © Ct is the set of all sequences obtained by 
juxtaposing codewords in C\ , Ci , . . . , Ct in this order. Clearly, 

Ci © C 2 © • ■ • © C t is an [n x + n 2 H h n t , fci + k 2 H h 

kt, mm{di, dz, . . . , dt}] binary linear block code. 

Finally, recall that two codes are equivalent if there is a 
fixed permutation of indices that maps one to the other [10], 
and that a code is said to have no zero-coordinates if and only 
if there is no index i S {1,2, ... ,n} such that Ci = for 
every codeword (ci, c 2 , . . . , c„). 

Lemma 3: The code C\ ©C 2 © • • • ©Ct is minimum stopping 
if and only if Ci is minimum stopping for all i = 1, . 
Proof: First, notice that the code C — C\ © C 2 © ■ • • 



a block diagonal parity check matrix Hi © H2 ( 
defined by 

/Hi ••• 
H 2 

Hi © H 2 © ■ • ■ © H t 



.,t. 

1 Ct has 
■ ©H t 



(26) 



V ■ ■ • H t / 

where H; is a parity check matrix for Ci. The complete parity- 
check matrix H* of C has all elements of the row space of 
the matrix from i26\ as its rows. 

Next, let S be a subset of {1, 2, . . . , n\ + n 2 + ■ ■ ■ + n t }. 
For i = 1,2, . . . ,t, define 

i-l 

Si = {1 < m < rii : m + rij G S}. (27) 



Then, it follows from d25i that S is the support of a codeword 
in C\ © C 2 © • ■ • © Ct if and only if Si is the support of a 
codeword in C,; for all i = 1, 2, . . . , t. 

Further, S is a stopping set for H* if and only if Si is a 
stopping set for H* for all i — 1,2, ... ,t. This follows from 
the fact that a sequence r is a row in H* if and only if it can 
be written as r — (ri, r 2 , . . . , r t ) where Yi is a row in H* for 
alH = 1, 2, . . . , t. Hence, a sequence r is a row in H^ if and 
only if it can be written as r = (ri, r 2 , . . . , rt) where is a 
row in H* s for alH = 1,2, ... ,t. If for some i = 1,2, ... ,t, 
Si is not a stopping set for H*, then H* s . has a row of weight 
one. Juxtaposing this row with the all-zero rows in H* s , for 
all j ^ i, gives a row in H£ of weight one. This implies that 
S is not a stopping set for H*. On the other hand, if for all 
i = 1,2, ... , f, Si is a stopping set for H*, then for all i, 
H* 5 has no row of weight one. Juxtaposing rows of weights 
other than one yields a row of weight other than one. Hence, 
S is a stopping set for H*. 

We conclude that S is a stopping set for H* which is the 
support of a codeword in Ci © C 2 © • ■ • © C t if and only if, 
for all i — 1,2, ... ,t, Si is a stopping set for H* which is 
the support of a codeword in Cj. Hence, Ci © C 2 © ■ • • © Ct is 
minimum stopping if and only if, for all i = 1, 2, . . . , t, Ci is 
minimum stopping. ■ 

Lemma 4: Let C be a minimum stopping binary linear 
[n, k, d] block code with d > 2 and no zero-coordinates. If 
d = n, then C is lZ n . Otherwise, k > 2 and C is equivalent to 
Hd © C" for some binary linear [n — d, k — 1, d"] block code 
C" with d" > 2 and no zero-coordinates. 



Proof: Up to equivalence, we may assume that C has a 
codeword composed of d ones followed by n — d zeros. In 
particular, the first d columns in any given parity check matrix 
of C are linearly dependent, but no d— 1 columns are such. The 
row space of the submatrix composed of these first d columns 
has dimension d — 1 and a sequence of length d belongs to 
this row space if and only if its weight is even. Therefore, in 
case d = n, C has a full-rank (d— 1) X d parity check matrix 
of the form 



1 



h = h'; 







(28) 



\ 1 ••■ 1 / 

which shows that C is the repetition code of length n. Further, 
in case d < n — 1, C has a full-rank (n — k) x n parity check 
matrix of the form 



H 







H' 
H" 



(29) 



where H' and H" are (d — 1) X (n — d) and (n — k — d + 
1) x (n — d) matrices, respectively. Notice that H" has at least 
one row since d < n — k + 1 by the Singleton bound and if 
equality holds with 2 < d < n — 1, then C is the even weight 
code of length n > 3 which has {1,2,3} as a stopping set for 
H* which is not the support of a codeword. This contradicts 
the assumption that C is minimum stopping. Clearly, H" is 
a matrix of rank n — k — d + 1 since H in i29\ is a full- 
rank matrix. If k = 1, then H" has rank n — d and C has 
zero-coordinates. Therefore, k > 2. To complete the proof, it 
suffices to show that the row space of H' is a subspace of 
the row space of H" since, in this case, by elementary row 
operations, C has a parity-check matrix 

TT" 

" 1 (30) 



H 









H" 



and thus C = IZd © C" where C" is the code with parity check 
matrix H". This code has length n — d, dimension k — 1, 
and Hamming distance d" > d > 2 with no zero-coordinates. 
Now, suppose, to get a contradiction, that the above is not true, 
i.e., the row space of H' is not a subspace of the row space of 
H". Then, the null space of H" is not a subspace of the null 
space of H'. Let c" be a vector of length n — d which belongs 
to the null space of H" but not to the null space of H'. Up to 
equivalence, we may assume that c" is composed of w ones 
followed by n — d — w zeros, where w, 1 < w < n — d, is the 
weight of c". We claim that {1,2, . . . ,d + w} is a stopping 
set for H* which is not the support of a codeword in C. From 
J29L we have 



H 



{1,2,. ..,d+w} 







^■{1,2,. ..,w} 
n {l,2,...,w} 



(31) 



Notice that any nontrivial linear combination of the rows of 
yields a non-zero vector of even weight. Furthermore, 
since c", which starts with w ones followed by n — d — w 



zeros, is in the null space of H" 
ear combination of the rows of K 

weight vector. We conclude that no linear combination of the 



it follows that any lin- 
12 w} yi e lds an even 
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rows of H/j 2 d+w} yields a vector of weight one. Hence, 
{1,2, . . . ,d + w} is a stopping set for H*. Next, notice that 
if {1, 2, . . . , d + w} is the support of a codeword in C, then 
the columns in H/j ,2,. ...d+w} in <I3 1 > should add up to zero. 
From d28i . we know that the first d columns add up to zero. 
Therefore, the columns of H'^ 2 w ^ should add up to the 
zero. However, this cannot be the case as c", which starts with 
w ones followed by n — d — w zeros, is not in the null space 
of H'. In conclusion, we have shown that {1, 2, . . . ,d + w} is 
a stopping set for H* which is not the support of a codeword 
in C. This contradicts the fact that C is minimum stopping. ■ 
Lemma 5: If C is a minimum stopping binary linear [n, k, d] 
block code with d > 2 and no zero-coordinates, then C is 
equivalent to 

© 7^-n 2 © ' ■ " © TCri, , (32) 

for some integers n\, n%, . . . , rit > 2 and t > 1 such that 
ni + n 2 + ■ ■ ■ + n t = n. 

Proof: The lemma trivially holds for all codes of lengths two 
or less. We use induction and assume that it holds for all codes 
of length less than n. From Lemma we know that either C 
is equivalent to lZ n , which is consistent with the statement of 
the lemma, or k > 2 and C is equivalent to TZd © C" for some 
binary linear [n — d, k — l,d"] block code C" with d" > 2 
and no zero-coordinates. From Lemma [5] we know that C" is 
a minimum stopping code. Since C" has length n — d < n, 
it follows from the induction hypothesis that C" is equivalent 
to 7Z mi © • ■ • © 1Z mv , for some integers mi, . . . , m v > 2 and 
v > 1 such that mi + • ■ • + m v = n — d. Then, IZd © C" has 
the same form as given in the lemma. ■ 
Theorem 6: A binary linear [n, k, d] block code C is min- 
imum stopping, i.e., satisfies S* (x) = A{x), if and only if it 
is equivalent to 

7v. ni © 1Zn2 ffi ■ ' ' ffi l^-n u © 3~ nF ffi Z nz , (33) 

for some nonnegative integers m,ri2, ■ ■ ■ ,n u , np,nz and u, 
where n\,n%, . . . ,n u >2 and m+n2 + - ■ ■ + n u + riF + riz = 
n. 

Note: In the theorem, we allow u = in which case C 
is equivalent to T np ffi Z nz . We also allow np = and/or 
riz = 0, in which case the corresponding code with length 
zero disappears from TZ ni ffi TZ n2 ffi • • • ffi K nu ffi T np ffi Z nz . 

Proof: The "if "-part of the theorem follows from Lemma [3] 
and the observations that the S*(x) — A{x) property holds for 
any repetition code lZ ni , the full-code J- np , and the zero-code 

Zn z - 

Next, we proof the "only if'-part of the theorem. Up to 
equivalence, we may assume that C = C @J- np ®Z nz , where 
nz is the number of zero-coordinates of C, np is the number 
of codewords of weight one in C, i.e., the number of all-zero 
columns in any parity check matrix of C, and C is a binary 
linear code of length n — np — nz with d > 2 and no zero- 
coordinates. Here we assume that if n ~ np — nz, np, or 
nz equal zero, then the corresponding code disappears from 
C ®T np ®Z nz . If C' does not disappear, then it can be written 
as stated in Lemma [5] ■ 



V. Conclusion 

In this paper, we examined how the performance of iter- 
ative decoding when applied to a binary linear block code 
over an erasure channel depends on the parity-check matrix 
representing the code. This code representation determines the 
complexity of the decoder. We have shown that there is a trade- 
off between performance and complexity. 

In particular, we have shown that, regardless of the choice 
of the parity-check matrix, the stopping set enumerator differs 
from the weight enumerator except for a degenerate class of 
codes. In spite of that, it is always possible to choose parity- 
check matrices for which the dead-end set enumerator equals 
the incorrigible set enumerator. Iterative decoding based on 
such matrices is optimal, in the sense that it gives the same 
probability of unsuccessful decoding on the binary erasure 
channel as an exhaustive decoder. We presented bounds on the 
number of rows in parity-check matrices with optimal dead- 
end set enumerators, thus bounding the complexity of iterative 
decoding achieving optimal performance. 
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